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Abstract. In this paper we introduce a family of planar, modular and 
self-similar graphs which have small- world and scale- free properties. The 
main parameters of this family are comparable to those of networks as- 
sociated to complex systems, and therefore the graphs are of interest 
as mathematical models for these systems. As the clustering coefficient 
of the graphs is zero, this family is an explicit construction that does 
not match the usual characterization of hierarchical modular networks, 
namely that vertices have clustering values inversely proportional to their 
degrees. 

1 Introduction 

Research and studies performed in the last few years show that many networks 
associated with complex systems, like the World Wide Web, the Internet, tele- 
phone networks, transportation systems (including power and water distribu- 
tion networks), social and biological networks, belong to a class of networks 
now known as small-world scale-free networks, see [1,11] and references therein. 
These networks exhibit a small average distance and diameter (with respect to a 
random network with the same number of nodes and links) and, in many cases, a 
strong local clustering (nodes have many mutual neighbors). Another important 
common characteristic is that the number of links attached to the nodes usually 
obeys a power-law distribution (is scale-free). By introducing a new measuring 
technique, it has recently been discovered that many real networks are also self- 
similar, see [15,16]. Moreover, a degree hierarchy in these networks is sometimes 
related to the modularity of the system which they model. 
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Most of the network models considered are probabilistic, however in recent 
years some deterministic models have been proposed which are very often based 
on iterative constructions such that, at each step, one or more vertices are con- 
nected to certain subgraphs (for example, the so called k-trees [4]). Another 
technique produces graphs by duplication of certain substructures, see [5]. Here 
we propose a new family of graphs which generalize these methods by introducing 
at each iteration a more complex substructure than a single vertex. The result 
is a family of planar, modular, hierarchical and self-similar graphs, with small- 
world scale-free characteristics and with clustering coefficient zero. We note that 
some important real life networks, for example the networks associated to elec- 
tronic circuits or Internet [11], have these characteristics as they are modular, 
almost planar and with a reduced clustering coefficient and have small-world 
scale-free properties. Thus, these networks can be modeled by our construction. 
A related family of graphs based on triangles, and which therefore has a high 
clustering coefficient was introduced in [19]. 

2 Hierarchical modular graphs 

Several authors classify as hierarchical graphs, graphs with a modular structure 
and a strong connectedness hierarchy of the vertices which produces a power-law 
degree distribution. Moreover, they consider that the most important signature 
of hierarchical modularity is given by a clustering distribution with respect to the 
degree according to C(k) ocl/k, see [2,8,17]. In this section we define an analyze 
a family of hierarchical modular graphs, which are scale-free, planar and have 
clustering coefficient zero. They prove the existence of hierarchical graphs which 
do not have the above-mentioned relationship between the clustering coefficient 
and the degrees of the corresponding vertices. 

Deterministic models for simple hierarchical networks have been published 
in [12,13]. These models consider the recursive union of several basic structures 
(in many cases, complete graphs) by adding edges connecting them to a selected 
root vertex. These and other hierarchical graphs have been considered when 
modeling metabolic networks in [10,14]. Hierarchical modularity also appears in 
some models based on fc-trees or clique-trees, where the graph is constructed by 
adding at each step one or more vertices and each is connected independently to a 
certain subgraph [8,6,18]. The introduction of the so-called hierarchical product 
of graphs in [7] allows a generalization and a rigorous study of some of these 
models. 

In [15,16], Song, Havlin and Makse relate the scale- free and the self-similarity 
properties as they verify that many self-similar graphs associated to real life 
complex systems have a fractal dimension and provide a connection between 
this dimension and the exponent of the degree power-law. However, a classical 
scale- free mode, the preferential attachment by Barbasi- Albert [1], which many 
authors consider a paradigm for these networks, has a null fractal dimension. 
This is not a paradox as the Barabasi- Albert model lacks modularity because of 
its generation process based on the individual introduction of vertices. 



In the next subsection we give details of our construction which is also based 
on an iterative process. However, the introduction at each step of a certain 
substructure allows the formation of modules and results in a final graph with 
a self-similar structure. 

2.1 Iterative algorithm to generate the graph H(t) 

The graph H(t) is constructed as follows: For t = 0, H(Q) is C\, a length four 
cycle. We define now as generating cycle a cycle C4 whose vertices have not been 
introduced at the same iteration step and passive cycle a cycle C4 which does 
not verify this property. For t > 1, H (t) is obtained from H{t — 1) by considering 
all their generating cycles C4 and connecting, vertex to vertex, to each of them 
a new cycle C4. This operation is equivalent to adding to the graph a cube Q3 
by identifying vertex to vertex the generating cycle with one of the cycles of Q3. 
The process is repeated until the desired graph order is reached. 




Fig. 1. Graphs H(t) produced at iterations t = 1, 2 and 3. 

2.2 Recursive modular construction 

The graph H(t) can be also defined as follows: For t = 0, H(0) is the cycle C4. 
For t > 1, H{t) is produced from four copies of H{t — 1) by identifying, vertex 
to vertex, the initial passive cycle of each H(t—l) with each of four consecutive 
cycles of Q3 (leaving two opposite cycles of Q3 free), see Fig. 2. 

2.3 Properties of H(t) 

Order and size of H(t). — We use the following notation: V(t) and E{t) denote, 
respectively, the set of vertices and edges introduced at the step t and C(t) is 



Fig. 2. Modular construction of H(t) for t = 1, 2 y 3. At step t, we merge four copies 
of H(t — 1) to four cycles of the cube Qs, leaving opposite cycles free. See the text for 
details. 



the number of generating cycles C4 at this step (which will be used to produce 
the graph H(t + 1). 

Note that at each iteration, any generating cycle is replaced by four new 
generating cycles and one passive cycle. Therefore: C(t + 1) = 4 ■ C(t),t > 1 
and C(0) = 1. Thus C(t) — 4*. Moreover, each generating cycle introduces at 
the next iteration four new vertices and eight new edges. As a consequence, 
V(t) = 4 • C(t - 1) = 4 • 4*- 1 and E(t) = 8 • C(t - 1) = 8 ■ 4*" 1 = 2-4*, thus: 

* 4*+i 1 e * o . 4*+l 1 4 

m*)i = E^(*) = ^ 1 - toi = E^) = — 3^- 



Step Vertices 


Edges 


Number of active cycles 


4 
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1 
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3 
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Table 1. Number of vertices, edges and generating cycles of H(t) at each step. 



Degree distribution. — Intially, at t = 0, the graph is a single generating cycle 
C4 and its four vertices have degree two. 

When a new vertex i is added to the graph at iteration ij (ij > 1), it has 
degree 3. We denote by C(i,t) the number of generating cycles at iteration t 



which will produce new vertices that will connect to vertex i at step t + 1. At 
iteration i,;, when vertex i is introduced, the value of C(i,U) is 2. According 
to the construction process of the graph, at each iteration, each new neighbor 
of i belongs to two generating cycles where i is also a vertex. If wc denote as 
k(i,t) the degree of vertex i at step i, then we have the following relationship: 
C(i,t) = k(i,t) - 1. 

We now compute C{i,t). As we have seen above, each generating cycle where 
i belongs to, produces two new generating cycles which also have i as a vertex. 
Thus C(i,t) — 2 • C(i,t — 1). Using the initial condition C(i,U) — 2, we have 
C(i,t) — 2 t ~ ti+1 . Therefore the degree of vertex i at the step t is 

k(i,t) = 2 t ~ u+1 + 1. (2) 

Note that the initial four vertices of step follow a different process. In this 
case C(i,0) = 2 f and k(i,t) = 2* + 1. Thus, at step t the initial four vertices of 
the graph have the same degree than those introduced at step 1. 

From equation (2) we verify that the graph has a discrete degree distri- 
bution and we use the technique described by Newman in [11] to find the 
cumulative degree distribution P cum (fc) for a vertex with degree k: P cum (k) = 

Er<u \V(T)\/\V(U)\ = (4*'* 1 + 8)/(4* +1 + 8). 

Replacing tj, from equation (2), in the former equation = t+1— ln(fc — 1) /In 2 
we obtain P cum (k) = (16 • 4* • (k - 1)~ 2 + 8)/(4* + 8), which for large values of 
t, allows us to write P CU m(fc) ~ k 1 ~ lk = k~ 2 , and therefore the degree distribu- 
tion, for large graphs, follows a power-law with exponent 7^ = 3. Research on 
networks associated to electronic circuits (these networks show planarity, mod- 
ularity and a small clustering coefficient) gives similar values for their degree 
power-law distribution [9,11]. More precisely, the largest benchmark considered 
-a network with 24097 nodes, 53248 edges, average degree 4.34 and average dis- 
tance 11.05- has a degree distribution which follows a power-law with exponent 
3.0, precisely the same as in our model, and it has a small clustering coefficient 
C = 0.01. 
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Fig. 3. Log- log representation of the cumulative degree distribution for H(10) with 
|y| = 1398104 vertices. The reference line has slope —2. 



Diameter. — At each step we introduce, for each generating cycle, four new 
vertices which will form a new cycle C4 (and these vertices are among them 
at maximum distance 2). As all join the graph of the former step with one 
new edge, in the worst situation the diameter will increase by exactly 2 units. 
Therefore D t = D t -i + 2. t > 2. As D\ = 3, we have that the diameter of H{t) 
is D t = 2-t + 1 if t > 1. 

Average distance. — The average distance of H(t) is defined as: 

dt = Y d id , (3) 

|v(t)|(|v(t)|-i)/2 . .f-< J 

where dij is the distance between vertices i and j. We will denote as S t the sum 

The modular recursive construction of H (t) allows us to calculate the exact 
value of dt. At step t, H(t+1) is obtained from the juxtaposition of four copies of 
H (t), which we label Hf , <p = 1, 2, 3, 4, on top of the cube Q3 (see Figs. 2 and 4). 
The copies are connected one to another at the vertices which we call connecting 
vertices and we label w, x, y, z, o, r, s, and a. The other vertices of H (t + 1 will 
be called interior vertices. Thus, the sum of distances distance S t +i satisfies the 
following recursion: 

S t+ i = A S t + A t -A. (4) 

where A t is the sum over all shortest paths whose endvertices are not in the 
same H(t) copy and the last term compensates for the overcounting of some 
paths between the connecting vertices -for example, d(w, o) is included both in 
H\ and H?-. Note that the paths that contribute to A t must all go through 
at least one of the eight connecting vertices. The analytical expression for A t 
is not difficult to find. We denote as A^ the sum of all shortest paths with 




Fig. 4. Illustration of the classification of nodes in Hf , ip = 1,2, 3, 4. 



endvertices in H" and iff. A"' 13 excludes the paths such that either endvertex 
is a connecting vertex. Then the total sum A t is 

A t = A 1 / + A]' 3 + A\ A + A 2 ' 3 + A 2A + A 3A + 20 

+ X! {d w ,i + d ,i) + ^2 {d x ,i + d r ,i) 

ien^uHf, ieffiuff*, 
i^x,r,s,y : a :Z i^w ,o,a,z,s,y 

+ (d s ,i + d y ,i)+ { d a,i+d z ,i), (5) 

where the term 20 comes from the sum of d WiS , d w , y , d . s , d a , y , d Xja , d x . z , d 7%a , 
and d TiZ . 

By symmetry, A)' 2 = A) A = A 2,3 = A 3A , A) 3 = A 2 ' 4 , and J2i d w ,i = 
d ,i — d Xy i — d>r,i — d s ^ = y^ ; - d y ^ = y^ t - d ai i = y^ 7 - (iz^, and 

A = 4A 1 ' 2 + 2A 1 ' 3 + 8 J] ( 6 ) 

To calculate A t , we classify the interior vertices of H(t+ 1) into four different 
classes according to their distances to each of the four vertices w, x, y, and z. 
Vertices w, x, y, and z are not classified into any of these classes which we 
represent as Pi, P2, P3, and P4, respectively. This classihcation is represented 
in Fig. 4. By construction, for an arbitrary interior vertex v, there must exist 
one of the above mentioned vertices (say w) satisfying d VjW < d v _ x , d VtW < d v>y , 
and d VtW < d v>z . All the interior vertices nearest to w (resp. x, y, and z) are 
assigned to class Pi (resp. Pi, P3, and P4). The total number of vertices of H t 
that belong to the class P T (r = 1,2,3,4) is denoted by N tt p T . Since the four 
vertices w, x, y, and z play a symmetrical role, classes Pi, P2, P3, and P4 are 
equivalent. Thus, N t .p 1 = N t ,p 2 — N t ,p 3 — N t .p 4 which will be abbreviated to 
N t from now on. We have 

lv;i-4 4* - f 

4 3 W 

We denote by L t +i,p 1 {Lt+i,p 2 , Lt+i,p 3 , L t +i.p 4 ) the sum of distances between 
vertices w (x, y, z) and all interior vertices v <E Pi (P2, P3, P4) of H(t+1). Because 
of the symmetry, L t +i,p 1 — L t +i.p 2 — L t +i.p 3 ~ L t+ i i p 4 that will be written as 
L t +i for short. Taking into account the second method of constructing H{t), see 
Fig. 4, we can write the following recursive formula for L t +i: 

L t+ i=AL t + 2N t + l. (8) 

We can solve Eq. (8) inductively, with initial condition Li = 1, and we have 



L t = -L ( 3t . 4* + 2 • 4* - 2) . 



(9) 



We now return to compute Eq. (6), with Z\*' 2 given by the sum 



ueHl,u({vj,o,a,z}; »=1 j = l 

v £ f , v ^ { w , x , r , o } 

where P,*' 1 and P^' 2 are the vertex classes Pi and P, of H} and P 2 , respectively, 
and dpt.i p t,2 is the sum of distances d u , v for all vertices u E Pi C P* and 

w G P, C Hi 
We have: 

d p t,i p t,2 = ^ d UyV = ^ ( d u,o + d 0tr + d rtV ) = 2N t L t + AT 2 . (11) 
vePiCH^ vePiCHf 

Following the same process, we obtain d p t,i p t,2 for the different values of i and 
j, which we use in Eq. (10) giving: 



' 3 



A 1 / = 32 N t L t + 24 (N t ) 2 . (12) 

Analogously, we can obtain 

A]' 3 = 32 N t L t + 32 (N t ) 2 . (13) 

Now, to find an expression for A t , the only thing left is to evaluate the last 
term of Eq. (6), which can be obtained as above 

8 J2 d Wti = 64 L t + 128 N t . (14) 

Finally, and combining the former expressions, we write the exact result for 
the average distance of H(t), d t , as 

- 4 10 + 14 -4* + 3(4+1)16* 

d* = — • ; ; • (15) 

3 10 + 13 -4* + 4 -16* v ' 

Notice that for a large order (t — ► oo) dt — t + 1 ~ In | V* | , which means that the 
average distance shows a logarithmic scaling with the order of the graph, and 
has a similar behavior as the diameter (the graph is small- wor Id) . 

Strength distribution. — The strength of a node in a network is associated 
to resources or properties allocated to it, as the total number of publication of 
an author, in the case of the network associated to the Erdos number; the total 
number of passengers in the world-wide airports network, etc. 

In our case we associate to each vertex the area of the passive cycle, defined 
by the four vertices introduced at a given step. For this purpose we assume a 
uniform construction of the graph. At the initial step the area is Ao and we 



denote as At the area of the passive cycle introduced at step t. By convention, 
we establish that the area of this cycle is one fifth of the area of the cycle where it 
connects (as each introduction of a passive cycle is associated to the simultaneous 
introduction of four generating cycles). Therefore we have At = (g)*^4o- A vertex 
i introduced at U will have strength s(i,ti) = (^Y'Ao and it will keep it i n 
further steps t > ti. As we want to find the strength distribution for all vertices 
of the graph at step t, we have that s(i,ti) = {\) ti ~ t ■ A t - 

Using equation (2) we obtain the following power-law for the correlation 
between the strength and the degree of a vertex: 

s (M)-^ t «M)-l) ln5/1 " 3 , (16) 

which for large values of the degree k leads to s(k) ~ fc ln5 / ln3 . 

We should mention that similar exponents have been found for the relation 
between the strength and the degree of the node of real life networks like the 
airports network, Internet and the scientist collaboration graph [3]. 

After a similar analysis to the calculation of the degree distribution, we find 
that the strength distribution also follows a power law with exponent: 

* + (IT) 

It has been shown that if a weighted graph with a non-linear correlation be- 
tween strength and degree s(k) ~ k 13 and the degree and strength distributions 
follows power laws, P(k) ~ fc~ 7fc and P(s) ~ s~ 7s , then there exists a general 
relationship between y 7s given by ~f s — + ff- [3] . 

As in our case 7^ = 3 y (3 — In 5/ In 3 from the former relationship, the 
exponent of the strength distribution is 7 S = 3j^-| + ln2(j^-| — l)/ln5, and we 
obtain the same value j s (17) which was computed directly. 



3 Conclusion 



The family of graphs introduced and studied here has as main characteristics pla- 
narity, modularity, degree hierarchy, and small-world and scale-free properties. 
At the same time the graphs have clustering zero. A combination of modularity 
and scale-free properties is present in many real networks like those associated 
to living organism (protein-protein interaction networks) and some social and 
technical networks [13,14]. The added property of a small clustering coefficient 
appears also in some technological networks (electronic circuits, Internet, P2P) 
and social networks [11]. Therefore our model, with a null clustering coefficient, 
could be considered to model these networks and also it can be used to study 
other properties without the influence of the clustering. The deterministic char- 
acter of the family, as opposed to usual probabilistic models, should facilitate 
the exact computation of many network parameters. 

On the other hand, simple variations of our model allow the introduction 
of clustering. As an example, by adding to each passive cycle an edge we can 



introduce two triangles for each cycle and therefore obtain a planar graph with 
non-zero clustering. Replacing in the construction each passive cycle by a com- 
plete graph K4 will produce a family with a relatively large clustering coefficient. 
However the graph will no longer be planar. 
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